Haplotype association analysis of combining unrelated case-control and triads with consideration of population stratification
نویسندگان
چکیده
Combining data when data are collected under different study designs, such as family trios and unrelated case-control samples, gains more power and is cost-effective than analyzing each data separately. However, a potential concern is population stratification (PS) among unrelated case-control samples and analyses integrating data should address this confounding effect. In this paper, we develop a simpler method, haplotype generalized linear model (HGLM), that tests and estimates haplotype effects on disease risk and allows for modification against PS for combining data. We proposed to combine information across aggregations of haplotype weighted-counts estimated from population case-control data and trio data separately, and to perform subsequent GLM analysis. Furthermore, we present a framework of analysis of variance based on haplotype weighted-counts for detecting whether it is appropriate to combine two data sources, as well as the modified HGLM with clustering methods for addressing PS. We evaluate the statistical properties in terms of the accuracy, false positive rate (FPR) and empirical power using simulated data with regard to various disease risks, sample sizes, multi-SNP haplotypes and the presence of PS. Our simulation results indicate that HGLM performs comparably well with the likelihood-based haplotype association analysis, particularly when the haplotype effects are moderate, but may not perform well when dealing with lengthy haplotypes for small sample sizes. In the presence of PS, the modified HGLM remains valid and has satisfactory nominal level and small bias. Overall, HGLM appears to be successful in combining data and is simple to implement in standard statistical software.
منابع مشابه
Association study of four polymorphisms in the interleukin-7 receptor alpha gene with multiple sclerosis in Eastern Iran
Objective(s): Multiple sclerosis (MS) is an autoimmune demyelinating disease of the central nervous system (CNS) with unknown etiology. Various genetics and environmental factors contribute to the pathogenesis of the disease. The interleukin-7 receptor alpha chain (IL-7Ra) was identified as the first non-major histocompatibility complex (non-MHC) MS susceptibility locus. In this study we are tr...
متن کاملAn integrated genome-wide association analysis on rheumatoid arthritis data
We propose a nonparametric association analysis combining both family and unrelated case-control genotype data. Under the assumption of Hardy-Weinberg equilibrium, we formed an affected group to compare with a group of unaffecteds.Comparison with traditional case-control chi-square test and transmission-disequilibrium test shows that this new approach has noticeably improved power. All analysis...
متن کاملFamLBL: detecting rare haplotype disease association based on common SNPs using case-parent triads
MOTIVATION In recent years, there has been an increasing interest in using common single-nucleotide polymorphisms (SNPs) amassed in genome-wide association studies to investigate rare haplotype effects on complex diseases. Evidence has suggested that rare haplotypes may tag rare causal single-nucleotide variants, making SNP-based rare haplotype analysis not only cost effective, but also more va...
متن کاملAssociation of CpG-SNP and 3\'UTR-SNP of WFS1 with the Risk of Type 2 Diabetes Mellitus in an Iranian Population
Type 2 diabetes mellitus (T2DM) is one of the most common multifactorial disorders in Iran. Recent genome wide association studies (GWASs), and functional studies have suggested that WFS1 may predispose individuals to T2DM. However, to date, the possible association of such variants with T2DM in Iranians remained unknown. Here, we investigated the association of the two polymorphisms of WFS1 (r...
متن کاملAssociation of P53 (+16ins-Arg) Haplotype with the Increased Susceptibility to Breast Cancer in Iranian-Azeri Women
Background:Many case-control investigations have showed the correlation of TP53 gene polymorphisms with the risk of breast cancer. However, the findings are not consistent. It has been suggested that the investigation of P53 genotype combinations and haplotypes may be more helpful than the detection of single polymorphisms. In the present study, we investigate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 5 شماره
صفحات -
تاریخ انتشار 2014